Cross-Category Highlight Detection via Feature Decomposition and Modality Alignment

نویسندگان

چکیده

Learning an autonomous highlight video detector with good transferability across categories, called Cross-Category Video Highlight Detection(CC-VHD), is crucial for the practical application on video-based media platforms. To tackle this problem, we first propose a framework that treats CC-VHD as learning category-independent feature representation. Under framework, novel module, named Multi-task Feature Decomposition Branch which jointly conducts label prediction, cyclic reconstruction, and adversarial reconstruction to decompose features into two independent components: highlight-related component category-related component. Besides, align visual audio modalities one aligned space before conducting modality fusion, has not been considered in previous works. Finally, extensive experimental results three challenging public benchmarks validate efficacy of our paradigm superiority over existing state-of-the-art approaches detection.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Image alignment via kernelized feature learning

Machine learning is an application of artificial intelligence that is able to automatically learn and improve from experience without being explicitly programmed. The primary assumption for most of the machine learning algorithms is that the training set (source domain) and the test set (target domain) follow from the same probability distribution. However, in most of the real-world application...

متن کامل

Category Labels Highlight Feature Interrelatedness in Similarity Judgment

When objects carry the same or different label(s), our perception of the similarity of the objects changes. How does this happen? In two experiments, pictures of animal tissues were presented with fictitious labels and participants judged the similarity of the pictures. The perceived similarity increased when the fictitious labels highlight the interrelatedness of features; this effect of label...

متن کامل

Cascaded Face Alignment via Intimacy Definition Feature

In this paper, we present a fast cascaded regression for face alignment, via a novel local feature. Our proposed local lightweight feature, namely intimacy definition feature (IDF), is more discriminative than landmark shape-indexed feature, more efficient than the handcrafted scale-invariant feature transform (SIFT) feature, and more compact than the local binary feature (LBF). Experimental re...

متن کامل

Alignment and category learning.

Recent research shows that similarity comparisons involve an alignment process in which features are placed into correspondence. In 6 studies, the authors showed that alignment is involved in category learning as well. Within a category, aligned matches (feature matches occurring on the same dimension) facilitate learning more than nonaligned matches do (matches on different dimensions), althou...

متن کامل

Stimulus Modality and Perceptual Category Learning 1 Stimulus Modality Interacts with Category Structure in Perceptual Category Learning

Two experiments were conducted that examined information-integration and rule-based category learning using stimuli that contained auditory and visual information. Results suggest that it is easier to perceptually integrate information within these sensory modalities than across modalities. Conversely, it is easier to perform a disjunctive rule-based task when information comes from different s...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i3.25462